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Abstract. We study pushdown vector addition systems, which are syn¬ 
chronized products of pushdown automata with vector addition systems. 
The question of the boundedness of the reachability set for this model 
can be refined into two decision problems that ask if infinitely many 
counter values or stack configurations are reachable, respectively. Counter 
boundedness seems to be the more intricate problem. We show decid¬ 
ability in exponential time for one-dimensional systems. The proof is via 
a small witness property derived from an analysis of derivation trees of 
grammar-controlled vector addition systems. 


1 Introduction 

Pushdown vector addition systems are finite automata that can independently 
manipulate a pushdown stack and several counters. They are defined as syn¬ 
chronized products of vector addition systems with pushdown automata. Vector 
addition systems, shortly VAS, are a classical model for concurrent systems and 
are computationally equivalent to Petri nets. Formally, a fc-dimensional vector 
addition system is a finite set A C Z fe of vectors called actions. Each action 
a £ A induces a binary relation -—*■ over N fe , defined by c d if d = c + a. 

A fc-dimensional pushdown vector addition system, shortly PVAS , is a tuple 
( Q,r,qi n i t ,Ci n it,Wi n it,A ) where Q is a finite set of states, f is a finite stack 
alphabet, qmit £ Q is an initial state, Ci n n £ N fc is an initial assignment of the 
counters, Wi n n £ F* is an initial stack content, and A C Q x Z fc x Op(r) x Q is 
a finite set of transitions where Op(F) = {push(7), pop(7), nop | 7 £ r} is the 
set of stack operations. The size of VAS, PVAS (and GVAS introduced later) are 
defined as expected with numbers encoded in binary. 

Example 1.1. Consider the program on the left of Figure 1, that doubles the 
value of the global variable x. The * expression non-deterministically evaluates 
to a Boolean, as it is often the case in abstraction of programs [1]. On the right 
is a 1-dinrensional PVAS that models this procedure: states correspond to lines 
in the program code, operations on the variable x are directly applied, and the 
call stack is reflected on the pushdown stack. □ 

* This work was partially supported by ANR project ReacHard (ANR-11-BS02-001). 
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1 : x <— n 

2: procedure DoubleX 
3: if (★ A x > 0) then 

4: x <— (x — 1) 

5: DoubleX 

6: end if 

7: x <— (x + 2) 

8: end procedure 

Fig. 1. A PVAS modeling a recursive program. 

The semantics of PVAS is defined as follows. A configuration is a triple 
(g, c, w) £ Q x N fc x r* consisting of a state, a vector of natural numbers, and 
a stack content. The binary step relation —> over configurations is defined by 
( p,c,u ) -A (q,d,v) if there is a transition (p, op,a,g) £ A such that c —> d 
and one of the following conditions holds: either op = push( 7 ) and v = try, or 
op = pop(y) and u = uy, or op = nop and u = v. The reflexive and transitive 
closure of —> is denoted by A. 

The reachability set of a PVAS is the set of configurations (g, c, w) such 
that (qinit, Cinit, Win.it) —> (g, c, w). The reachability problem asks if a given 
configuration (g, c, w) is in the reachability set of a given PVAS. The decidability 
of this problem is open. Notice that for vector addition systems, even though the 
reachability problem is decidable [ 12 , 6 ], no primitive upper bound of complexity is 
known (see [9] for a first upper bound). However, a variant called the coverability 
problem is known to be ExpSPACE-complete [13,11]. Adapted to PVAS, the 
coverability problem takes as input a PVAS and a state q £ Q and asks if 
there exists a reachable configuration of the form ( q , c, w) for some c and w. The 
decidability of the coverability problem for PVAS is also open. In fact, coverability 
and reachability are inter-reducible (in logspace) for this class [7,10]. In dimension 
one, we recently proved that coverability is decidable [ 10 ]. 

Both coverability and reachability are clearly decidable for PVAS with finite 
reachability sets. These PVAS are said to be bounded. In [8], this class is proved to 
be recursive, i.e. the boundedness problem for PVAS is decidable. The complexity 
of this problem is known to be ToWER-hard [7]. The decidability is obtained 
by observing that if the reachability set of a PVAS is finite, its cardinality is at 
most hyper-Ackermannian in the size of the PVAS. Even though this bound is 
tight [8], the exact complexity of the boundedness problem is still open. Indeed, it 
is possible that there exist small certificates that witness infinite reachability sets. 
For instance, in the VAS case, the reachability set can be finite and Ackermannian. 
But when it is infinite, there exist small witnesses of this fact [13]. This yields 
an optimal [11] exponential-space algorithm for the VAS boundedness problem. 
Extending this technique to PVAS is a challenging problem. 

The boundedness problem for PVAS can be refined in two different ways. 
In fact, the infiniteness of the reachability set may come from the stack or the 
counters. We say that a PVAS is counter-bounded if the set of vectors c £ N fc 
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such that ( q , c, w) is reachable for some q and w, is finite. Symmetrically, a PVAS 
is called stack-bounded if the set of words w £ T* such that (q, c, w ) is reachable 
for some q and c, is finite. The following lemma shows that the two associated 
decision problems are at least as hard as the boundedness problem. 

Lemma 1.2. The boundedness problem is reducible in logarithmic space to the 
counter-boundedness problem and to the stack-boundedness problem (the dimen¬ 
sion k is unchanged by the reduction). 

The stack-boundedness problem can be solved by adapting the algorithm 
introduced in [8] for the PVAS boundedness problem. Informally, this algorithm 
explores the reachability tree and stops as soon as it detects a cycle of transitions 
whose iteration produces infinitely many reachable configurations. If this cycle 
increases the stack, we can immediately conclude stack-unboundedness. Otherwise, 
at least one counter can be increased to an arbitrary large number. By replacing 
the value of this counter by w and then resuming the computation of the tree from 
the new (extended) configuration, we obtain a Karp&Miller-like algorithm [5] 
deciding the stack-boundedness problem. We deduce the following result. 

Lemma 1.3. The stack-boundedness problem for PVAS is decidable. 

Concerning the counter-boundedness problem, adapting the algorithm intro¬ 
duced in [8] in a similar way seems to be more involved. Indeed, if we detect a 
cycle that only increases the stack, we can iterate it and represent its effect with 
a regular language. However, we do not know how to effectively truncate the 
resulting tree to obtain an algorithm deciding the counter-boundedness problem. 


Contributions. In this paper we solve the counter-boundedness problem for 
the special case of dimension one. We show that in a grammar setting, PVAS 
counter-boundedness corresponds to the boundedness problem for prefix-closed, 
grammar-controlled vector addition systems. We show that in dimension one, 
this problem is decidable in exponential time. Our proof is based on the existence 
of small witnesses exhibiting the unboundedness property. This complexity result 
improves the best known upper bound for the classical boundedness problem 
for PVAS in dimension one. In fact, as shown by the following Example 1.4, the 
reachability set of a bounded 1-dimensional PVAS can be Ackermannian large. 
In particular, the worst-case running time of the algorithm introduced in [8] for 
solving the boundedness problem is at least Ackermannian even in dimension 
one. 


Example l.f. The Ackermann functions A m : N —> N, for m £ N, are defined by 
induction for every n £ N by: 


A m (n) = 


n + 1 if m = 0 
e\(l) if m > 0 


These functions are weakly computable by the (family of) PVAS depicted in 
Figure 2, in the sense that: 


A m (n) = max{c | (±,n,j m ) A (_L,c,e)} 


(1) 
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Fig. 2. One-dimensional PVAS that weakly compute Ackermann functions. 


for every m,n £ N. Indeed, an immediate induction on k £ {0,... , to} shows 
that (_L,c, 7 fc) A (A, Ah(c),e) for every c £ N. For the converse inequality, let us 
introduce, for each configuration (A, c, w), the number 0(c, w) defined by 

0(c,7ii-"7* fc ) = A h °---°Ai k (c) 

An immediate induction on the number of times a run come back to the state 
A shows that (A,c, w) —> (A ,d,w') implies 9{c,w) > 9(d,w'). Since 9{c,e) = c, 
we derive that A m [n) > c for every c such that (A,n, y m ) A- (A, c, e). This 
concludes the proof of Equation (1). 

Notice that the reachability set of this PVAS is finite for any initial con¬ 
figuration. Indeed, (A, c, w) A (A ,d,w') implies 9{c,w) > 9(d,w') > d A \w'\. 
Therefore, there are only finitely many reachable configurations in state A. It 
follows that the same property holds for the other states. □ 

Outline. We recall some necessary notations about context-free grammars and 
parse trees in the next section. In Section 3, we present the model of grammar- 
controlled vector addition systems (GVAS) as previously introduced in [10], and 
reduce the counter boundedness problem for PVAS to the boundedness problem 
for the subclass of prefix-closed GVAS. We show in Section 4 that unbounded 
systems exhibit certificates of a certain form. Section 5 proves a technical lemma 
used later on and finally, in Section 6, we bound the size of minimal certificates 
and derive the claimed exponential-time upper bound. 

2 Preliminaries 

We let Z = ZU{— oo, +oo} denote the extended integers, and we use the standard 
extensions of + and < to Z. Recall that (Z, <) is a complete lattice. 

Words. Let A* be the set of all finite words over the alphabet A. The empty word 
is denoted by e. We write |io| for the length of a word w in A* and w k A ww ■ ■ ■ w 
for its fc-fold concatenation. The prefix partial order A over words is defined by 
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u A v if v = uw for some word w. We write u -< v if u is a proper prefix of v. A 
language is a subset L C A*. A language L is said to be prefix-closed if u A v 
and v £ L implies u £ L. 

Trees. A tree T is a finite, non-empty, prefix-closed subset of N* satisfying the 
property that if tj is in T then ti in T for all i < j. Elements of T are called 
nodes. Its root is the empty word e. An ancestor of a node t is a prefix s A t. A 
child of a node t in T is a node tj in T with j in N. A node is called a leaf if it 
has no child (i.e., fO ^ T), and is said to be internal otherwise. The size of a tree 
T is its cardinal |T|, its height is the maximal length |t| of its nodes t £ T. We 
let ^iex denote the lexicographic order on words in N*. 

Context-free Grammars. A context-free grammar is a quadruple G = ( V , A, R , S ), 
where V and A are disjoint finite sets of nonterminal and terminal symbols, 
S £ V is a start symbol, and R C V x (V U A)* is a finite set of production rules. 
We write 

X b ai | Q!2 | ... | ak 

to denote that (A', oq),..., (A", a*,) £ R. For all words w,w' £ (V U A)*, the 
grammar admits a derivation step w => w' if there exist two words u, v in 
(V U A)* and a production rule (A', a) in R such that w = uXv and w' = uav. 
Let ==> denote the reflexive and transitive closure of =>. The language of a 
word w in (V U A)* is the set L G = {z £ A* \ w =A> z}. The language of G is 
defined as L G , and it is denoted by L G . A nonterminal X £ V is called productive 
if L® 0- A context-free grammar G = (V, A, R, S) is in Chomsky normal form 3 
if, for every production rule (A, a) in R, either (A, a) = (S', e) or a £ V 2 U A. 

Parse Trees. A parse tree for a context-free grammar G = (V ,, A, R, S) is a tree 
T equipped with a labeling function sym : T —> (blldU {e}) such that the 
root is labeled by sym(e) = S and R contains the production rule sym(t) h 
sym(tO) ■ ■ ■ sym(tk ) for every internal node t with children tO, ..., tk. In addition, 
each leaf t e with sym(t) = e is the only child of its parent. Notice that 
sym(t) £ V for every internal node t. A parse tree is called complete when 
sym(t) £ (dU {e}) for every leaf t. The yield of a parse tree (T, sym) is the word 
sym{t\) ■ ■ ■ sym(te) where ti,...,te are the leaves of T in lexicographic order 
(informally, from left to right). Observe that for every word w in (V U A)*, it 
holds that S w if, and only if, w is the yield of some parse tree. 

3 Grammar-Controlled Vector Addition Systems 

In this section we recall the notion of GVAS from [10] and show that the 
boundedness problem for the subclass of prefix-closed GVAS is inter-reducible to 
the counter-boundedness problem for pushdown vector addition systems. 

3 To simplify the presentation, we consider a weaker normal form than the classical 
one, as we allow to reuse the start symbol. 
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Definition 3.1 (GVAS). A k -dimensional grammar-controlled vector addition 
system (shortly, GVAS,) is a tuple G = (V, A, R, S, Ci n i t ) where ( V,A,R,S ) is a 
context-free grammar, A C Z fc is a VAS, and c init £ is an initial vector. 

The semantics of GVAS is given by extending the relations -—>■ of ordinary 
VAS to words over V U A as follows. Define —A to be the identity on N fc and 
let —> = —> o —y for z £ A and a £ A. Finally, let —> = \J z( z L g —> for 
w £ (V U A)*. For a word z = a±a 2 ■ ■ ■ a n £ A* over the terminals, we shortly 
write Y) z f° r the sum a,;. Observe that c —^ d implies d — c = z. 

s 

Ultimately, we are interested in the relation —K that describes the reachability 
relation via sequences of actions in Lg, i.e., those that are derivable from the 
starting symbol S in the underlying grammar. A vector d £ N fc is called reachable 
from a vector c £ if c —^ d. The reachability set of a GVAS is the set of 
vectors reachable from c™ t . 

A GVAS is said to be bounded if its reachability set is finite. The associated 
boundedness problem for GVAS is challenging since the coverability problem 
for PVAS, whose decidability is still open, is logspace reducible to it. However, 
the various boundedness properties that we investigate on PVAS (see Section 1) 
consider all reachable configurations, without any acceptance condition. So they 
intrinsically correspond to context-free languages that are prefix-closed, ft is 
therefore natural to consider the same restriction for GVAS. Formally, we call a 
GVAS G = (' V , A , R, S, C lm t) prefix-closed when the language Lg is prefix-closed. 
Concerning the counter-boundedness problem for PVAS, the following lemma 
shows that it is sufficient to consider the special case of prefix-closed GVAS. 

Lemma 3.2. The counter-boundedness problem for PVAS is logspace inter- 
reducible with the prefix-closed GVAS boundedness problem (the dimension k is 
unchanged by both reductions). 

In this paper, we focus on the counter-boundedness problem for PVAS of 
dimension one. We show that this problem is decidable in exponential time. The 
proof is by reduction, using Lemma 3.2, to the boundedness problem for prefix- 
closed 1-dimensional GVAS. Our main technical contribution is the following 
result. 

Theorem 3.3. The prefix-closed 1-dimensional GVAS boundedness problem is 
decidable in exponential time. 

For the remainder of the paper, we restrict our attention to the dimension 
one, and shortly write GVAS instead of 1-dimensional GVAS. 

Example 3.). Consider again the Ackermann functions A m introduced in Exam¬ 
ple 1.4. These can be expressed by the GVAS with nonterminals X 0 ,..., X m and 
with production rules A' 0 h 1 and Xi b —1 X, X i _ 1 | lX t _i for 1 < i < m. It is 
routinely checked that max{d | c " ■ - ■ > d} = A m (c) for all cgN. □ 
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Every GVAS can be effectively normalized, in logarithmic space, by replacing 
terminals a £ Z by words over the alphabet {—1,0,1} and then putting the 
resulting grammar into Chomsky normal form. In addition, non-productive 
nonterminals, and production rules in which they occur, can be removed. So in 
order to simplify our proofs, we consider w.l.o.g. only GVAS of this simpler form. 

Assumption. We restrict our attention to GVAS G = (V, A, R, S,Ci n it) in Chom¬ 
sky normal form and where A = {—1,0,1} and every X £ V is productive. 

The rest of the paper is devoted to the proof of Theorem 3.3. Before delving 
into its technical details, we give a high-level description the proof. In the 
next section, we characterize unboundedness in terms of certificates, which are 
complete parse trees whose nodes are labeled by natural numbers (or —oo). These 
certificates contain a growing pattern that can be pumped to produce infinitely 
many reachable (1-dimensional) vectors, thereby witnessing unboundedness. We 
then prove that certificates need not be too large. To do so, we first show in 
Section 5 how to bound the size of growing patterns. Then, we bound the 
height and labels of “minimal” certificates in Section 6. Both bounds are singly- 
exponential in the size of the GVAS. Thus, the existence of a certificate can be 
checked by an alternating Turing machine running in polynomial space. This 
entails the desired ExpTime upper-bound stated in Theorem 3.3. 

4 Certificates of Unboundedness 

Following our previous work on the GVAS coverability problem [10], we annotate 
parse trees in a way that is consistent with the VAS semantics. A flow tree for a 
GVAS G = ( V , A, R, S, Cinit) is a complete 4 parse tree (T, sym) for G equipped 
with two functions in , out :T->NU {—oo}, assigning an input and an output 
value to each node, with in(e) = Ci n u , and satisfying, for every node t £ T, the 
following flow conditions: 

1. If t is internal with children fO,... ,tk, then in(t0) < in{t), out{t) < out(tk), 
and in(t(j + 1)) < out(tj ) for every j = 0,..., k — 1. 

2. If t is a leaf, then out(t) < in(t) + a if sym(t ) = a £ A, and out(t) < in(t) if 
sym(t) = e. 

We shortly write t : c#d to mean that (in(t ), sym(t ), out(t)) = (c, #,d). The 
size of a flow tree is the size of its underlying parse tree. Figure 3 (left) shows 
a flow tree for the GVAS of Example 3.4, with start symbol X\ and initial 
(1-dimensional) vector = 5. 

Remark 4-1. The flow conditions enforce the VAS semantics along a depth-first 
pre-order traversal of the complete parse tree. But, as in [10], we only require 
inequalities instead of equalities. This corresponds to a lossy VAS semantics, where 

4 Compared to [10] where flow trees are built on arbitrary parse trees, the flow trees 
that we consider here are always built on complete parse trees. 
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the counter can be non-deterministically decreased [2] . The use of inequalities 
in our flow conditions simplifies the presentation and allows for certificates of 
unboundedness with smaller input/output values. Note that equalities would be 
required to get certificates of reachability, but the latter problem is out of the 
scope of this paper. 

C< 

Lemma 4.2. For all d with Ci n n —> d, there exists a flow tree with out(e) = d. 

Our main ingredient to prove Theorem 3.3 is a small model property. First, we 
show in this section that unboundedness can always be witnessed by a flow tree of 
a particular form, called a certificate (see Definition 4.5 and Figure 3). Then, we 
will provide in Theorem 6.10 exponential bounds on the height and input/output 
values of “minimal” certificates. This will entail the desired ExpTime upper- 
bound for the prefix-closed GVAS boundedness problem. 

We start by bounding the size of flow trees that do not contain an iterable 
pattern, i.e., a nonterminal that repeats, below it, with a larger or equal input 
value. Formally, a flow tree (T, sym, in, out) is called good if it contains a node t 
and a proper ancestor s -< t such that sym(s) = sym(t) and in(s) < in(t). It is 
called bad otherwise. We bound the size of bad flow trees by (a) translating them 
into bad nested sequences, and (b) using a bound given in [8] on the length of 
bad nested sequences. Let us first recall some notions and results from [8] . Our 
presentation is deliberately simplified and limited to our setting. 

Let (S, be the normed quasi-ordered set defined by S = V x N, 

(X,m) A (Y,n) <=>■ X = Y A m < n, and ||(X, m)\\ = m. A nested sequence 
is a finite sequence (si, hi ),..., {se, he) of elements in S' x N satisfying hi = 0 
and hj + i £ hj + {—1,0,1} for every index j < £ of the sequence. A nested 
sequence (si, hi), ..., ( se, he) is called good if there exists i < j such that Si A Sj 
and hi < h i+ i,... ,hj. A bad nested sequence is one that is not good. A nested 
sequence (si, hi ),..., ( se, he) is called n-controlled, where n £ N, if ||sj|| < n + j 
for every index j of the sequence. 

Theorem 4.3 ([8, Theorem VI.1]). Let n £ N with n > 2. Every n-controlled 
bad nested sequence has length at most F u \y)n). 

The function F u \y\ : N —> N used in the theorem is part of the fast-growing 
hierarchy. Its precise definition (see, e.g., [8]) is not important for the rest of 
the paper. The following lemma provides a bound on the size of bad flow trees. 
Notice that this lemma applies to arbitrary GVAS (not necessarily prefix-closed). 

Lemma 4.4. Every bad flow tree has at most F w \v\( c init + 2) nodes. 

A good flow tree contains an iterable pattern that can be “pumped”. However, 
the existence of such a pattern does not guarantee unboundedness. For that, we 
need stronger requirements on the input and output values, as defined below. 

Definition 4.5 (Certificates). A certificate for a given GVAS is a flow tree 
(T, sym, in, out) equipped with two nodes s -< t in T such that 

sym(s) = sym(t) and in(s) < in(t) and in(s) < in(t) or out(t) < outfs) 
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Fig. 3. Left: a flow tree for the GVAS of Example 3.4 with a-n.it = 5. Input and 
output values are indicated in red and blue, respectively. Right: A certificate with 
sym(t) = sym(s) = X and yield xuwvy £ A*. It must hold that either in(s) < in(t) or 
in(s) = in(t) and out(t) < out(s). 


We now present the main result of this section, which shows that unbounded¬ 
ness can always be witnessed by a certificate. 

Theorem 4.6. A prefix-closed GVAS G is unbounded if, and only if, there exists 
a certificate for G. 

5 Growing Patterns 

Certificates depicted on Figure 3 (right) introduce words u £ A* satisfying a 
sign constraint ^ u > 0 or u = 0. These words are derivable from words of 
non-terminal symbols Si ... Sk corresponding to the left children of the nodes 
between s and t. In order to obtain small certificates, in this section, we provide 
bounds on the minimal length of words v! £ A* that can also be derived from 
S\ ... Sk and that satisfy the same sign constraint as u. 

Let us first introduce the displacement of a GVAS G as the “best shift” 
achievable by a word in Lr and defined by the following equality 5 : 

A g = sup{J>|z£L G } 

When the displacement is finite, the following Lemma 5.1 shows that it is 
achievable by a complete elementary parse tree. We say that a parse tree T is 
elementary if for every s A t such that sym(s) = sym{t- ), we have s = t. Notice 
that the size of an elementary parse tree is bounded by 2^' I+ 1 . 

Lemma 5.1. Every GVAS G admits a complete elementary parse with a yield 
w such that A G £ w, +oo}. 


5 Notice that A G may be negative. 
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Given a non-terminal symbol A, we denote by G[A'] the context-free grammar 
obtained from G by replacing the start symbol by A'. We are now ready to state 
the main observation of this section. 

Theorem 5.2. For every sequence S\,.... S k of non-terminal symbols of a 
GVAS G there exists a sequence T\,...,T k of complete parse trees Tj for Gj = 
G[Sj] with a yield Zj such that |Ti| + • • • + \T k \ < "5k4) v \ +1 , and such that 
Yfzi...z k >0 if A Gl -|- \-A Gk > 0, andj^zi... z k = 0 */Z\ Gl H-h A Gk = 0. 

We first provide bounds on complete parse trees that witness the following 
properties A G = +oo and A is derivable. Formally, a nonterminal X is said to be 
derivable if there exists w £ (A U V)* that contains X and such that S ===>■ w. 

Lemma 5.3. If A G = +oo, there exists a parse tree for G[X] where X is a 
non-terminal symbol derivable from the start symbol S with a yield uXv satisfying 
u,v £ A*, Y uv > 0) and a number of nodes bounded by 4^l +1 . 


Lemma 5.4. For every derivable non-terminal symbol A, there exists a parse 
tree with a yield in A*XA* and a number of nodes bounded by 4) v \ +1 . 

Proof (of Theorem 5.2). We can assume that k > 1 since otherwise the proof is 
trivial. Observe that if A Gl + • • • + A Gk < +oo then A Gj < +oo for every j. It 
follows from Lemma 5.1 that there exists a complete parse tree Tj for G[Sj] with a 
yield Wj satisfying A Gj = Y w j and a number of nodes bounded by 2^ l +1 . Thus 
|ri|+--- + |r fc | < k2^ l +1 and Y w i ■ ■ ■ w k = 4\ Gl + • • • + A Gk . So, in this special 
case the theorem is proved. Now, let us assume that A Gl + • • • + A Gk = +oo. 
There exists p £ {1,..., k} such that A Gp = +oo. Lemma 5.3 shows that there 
exists a variable for X derivable from S p and a parse tree T + for G[A'] with a yield 
uXv satisfying it, v £ A*, Y uv > 0, and such that |T + | < 4l v l +1 . Since Sj is 
productive, there exists a complete elementary parse tree Tj for G[Sj] with a yield 
Wj £ A*. For the same reason, there exists a complete elementary parse tree T for 
G[A] with a yield w £ A*. As X is derivable from S, Lemma 5.4 shows that there 
exists a parse tree T' for G with a yield labeled by a word in v!Xv' with u',v' £ A*, 
and a number of nodes bounded by 4) v \ +1 . Notice that for any n £ N, we deduce 
a complete parse tree T p for G[S p ] with a yield w p = u'u n wv n v' by inserting 
in T' many (n) copies of T + and one copy of T. Observe that Y w i ■ ■ ■ w k > 
— |wi... w p - iw p+ i... w k \ — \u'wv'\ + nY) uv > —k2' [V \ +1 — 4l v l +1 + n. Let us 
Hx n to 2fc4^ l +1 — 2. It follows that Y w i ■ ■ ■ w p >0. Moreover, we have \T p \ < 
|T| — l + n(|T+| — 1) + |T'| < 2l y l +1 +n4l y l +1 +4l v l +1 < [n+ 2)4l y l +1 < 2fc4l y l +1 . 
We derive |2i| + • • • + \T k \ < (k — l)2^ v ^ +1 + 2k4) v ^ +1 < 3k4) x l +1 . We have proved 
Theorem 5.2. □ 


6 Small Certificates 

We provide in this section exponential bounds on the height and input/output 
values of minimal certificates in the following sense. Let the rank of a flow tree 
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(T, sym, in, out ) be the pair 


\T in \ + |T ou t| , (*) + oui ( i )) 

t£T ou t 

where T in = {t £ T \ in(t) > — 00 } and T out = {t £ T \ out(t) > — 00 }. Notice 
that T out C T ln . We compare ranks using the lexicographic order A lex over N 2 
and let the rank of a certificate (T,s,t) be the rank of its flow tree T. 

Consider a prefix-closed GVAS G = (V , A, R, S, Omit) that is unbounded. By 
Theorem 4.6, there exists a certificate for G. Pick a certificate (T, s, t) among 
those of least rank. Our goal is to bound the height and input/output values 
of T. Based on its assumed minimality, we observe a series of facts about our 
chosen certificate. 

First, we observe that some input/output values in T must be — 00 , because 
higher values would be useless in the sense that they can be set to —00 without 
breaking the flow conditions nor the conditions on s and t. This observation is 
formalized in the two following facts. 

Fact 6.1. It holds that out(p) = —00 for every proper ancestor p -< s. Moreover, 
in(p) = out(p ) = —00 for every node p £ T such that s Ai ex P and p ^ s. 

Fact 6.2. Assume that in(s) < in(t). It holds that out(j> ) = —00 for every 
ancestor p <t. Moreover, in(p) = out(p ) = —00 for all p £ T with t ^i ex P- 

Next, we observe that the main branch, that contains s and t , must be short. 

Fact 6.3. It holds that |s| < \V\ and |t| < |s| + \V\ + 1. 

The next two facts provide relative bounds on input and output values for 
nodes that are not on the main branch. 

Fact 6-4- It holds that in(p) < out(p) + 21 v " I for every node p £T with p y^t. 


Fact 6.5. Let q £ T and let p be the parent of q. If p = t or p ^ t, then 
out(q) < out(p ) + 2^ v I. If moreover sym(p) = sym(q ), then out(q) < out{p). 

The following facts provide absolute bounds on the input/output values of 
nodes s and t. The proofs of the facts below crucially rely on Section 5. Consider 
the subtrees on the left and on the right of the branch from s to t. The main 
idea of the proofs is to replace these subtrees by small ones using Theorem 5.2. 

Fact 6.6. It holds that out(t) < out(s) < 6\V\ ■ 4l y l +1 . 


Fact 6.7. It holds that in(s) < in(t) < 7\V\ ■ 
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Now we derive absolute bounds for the input/output values of the remaining 
nodes on the main branch. These are derived from Facts 6.6 and 6.7, using 
Facts 6.4 and 6.5 about the way in/output values propagate and the Fact 6.3 
that the intermediate path between nodes s and t is short. 

Fact 6.8. It holds that out(p) < 4 2 ^ v for every ancestor p <t. 


Fact 6.9. It holds that in(p ) < 4 2 ^ v l +1 ) for every ancestor £ -< p A t. 

We are now ready to derive bounds on the rank of our minimal certificate. 
Notice that it remains only to bound the depth and the input/output values on 
branches different from the main branch. 

Consider therefore a node q outside the main branch, i.e., q-£t. Let p be the 
least prefix of q such that p = t or p -ff t. We first show that out(p) < 4 2 6 v l +1 h If 
p = t then the claim follows from Fact 6.8. Otherwise, the parent r of p satisfies 
r A t. Observe that the other child p of r satisfies p A t. The flow conditions 
together with the minimality of (T, s, t) guarantee that 

— if p = r 1 then out(p ) = out(r), hence, out(p ) < 4 2 ^ v l +1 i by Fact 6.8, and 

— if p = rO then out(p ) = m(rl), hence, out(p) < 4 2 C^1+!) by Fact 6.9. 

According to Fact 6.5, the output values on the branch from p down to q may 
only increase when visiting a new symbol. Moreover, this increase is bounded 
by 21 1 I. It follows that out(r) < out[p) + |F|2l' I for every node r such that 
p <r <q. Fact 6.4 entails that m(r) < out(p) + (|F| + 1) 21' I. We obtain that 
max{m(r), owf(r)} < 4 3 ^ v l +1 ) for every node r with p -< r A q. Fact 6.5 also 
forbids the same nonterminal from appearing twice with the same output value, 
so |r| < \p\ + \V\ ■ 4 3 (l' / l+ 1 ) + 1. Observe that \p\ < |t|. We derive from Fact 6.3 
that |r| < 4l 4 ‘U v \ +1 \ This concludes the proof of the following theorem. 

Theorem 6.10. A prefix-closed GVAS (V, A, R, S,Ci n u) is unbounded if, and 
only if it admits a certificate with height and all input/output values bounded by 
a„u + 4^ v i +1 ). 

Proof (of Theorem 3.3). By Theorem 6.10, a certificate for unboundedness is a 
flow tree of exponential height and with all input and output labels exponentially 
bounded. An alternating Turing machine can thus guess and verify all branches of 
such a flow tree, storing intermediate input/output values as well as the remaining 
length of a branch in polynomial space. The claim then follows from the fact 
that alternating polynomial space equals exponential time. □ 

7 Conclusion 

We discussed different boundedness problems for pushdown vector addition 
systems [8,7], which are a known, and very expressive computational model that 
features nondeterminism, a pushdown stack and several counters. These systems 
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may be equivalently interpreted, in the context of regulated rewriting [3], as 
vector addition systems with context-free control languages. 

We observe that boundedness is reducible to both counter- and stack-boundedness. 
The stack boundedness problem can be shown to be decidable (with hyper- 
Ackermannian complexity) by adjusting the algorithm presented in [8]. 

Here, we single out the special case of the counter-boundedness problem for 
one-dimensional systems and propose an exponential-time algorithm that solves 
it. This also improves the best previously known Ackermannian upper bound for 
boundedness in dimension one. 

Currently, the best lower bound for this problem is NP, which can be seen by 
reduction from the subset sum problem. For dimension two, PSPACE-hardness 
follows by reduction from the state-reachability of bounded one-counter automata 
with succinct counter updates [4]. For arbitrary dimensions, TowER-hardness 
is known already for the boundedness problem [7,8] but the decidability of 
counter-boundedness for PVAS remains open. 
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A Missing Proofs 

A.l Proofs for Section 1 

Lemma 1.2. The boundedness problem is reducible in logarithmic space to the 
counter-boundedness problem and to the stack-boundedness problem (the dimen¬ 
sion k is unchanged by the reduction). 

Proof. Let us consider a PVAS A and let us introduce a PVAS A' such that A is 
bounded if, and only if, A' is bounded, and such that if A is unbounded then A! 
is both counter-unbounded and stack-unbounded. The system A' is a copy of A, 
extended with a new state _L. The state ± has self-loops that allow to pop any 
symbol from the stack and simultaneously increment the first counter. It also has 
self-loops that decrease any counter and push some symbol to the stack. Finally, 
we add a transition ( q , 0, nop, _L) for each original state q. Now just observe that 
A is bounded if, and only if, A' is bounded. Moreover if A is unbounded, then 
A! is both counter-unbounded and stack-unbounded. □ 

A.3 Proofs for Section 3 

Lemma 3.2. The counter-boundedness problem for PVAS is logspace inter- 
reducible with the prefix-closed GVAS boundedness problem (the dimension k is 
unchanged by both reductions). 

Proof. Just observe that a PVAS can be interpreted as a pushdown automaton 
that recognizes a context-free and prefix-closed trace language L C A* where 
AC Z is the set of vectors labeling the transitions. We can construct, in 
logarithmic space, a context-free grammar that produces L. This context-free 
grammar (equipped with the initial value Ci n u of the PVAS) is a prefix-closed 
GVAS G. The reachability set of G is exactly the set of vectors c such that 
(g, c, w) is a reachable configuration of the PVAS for some q and w. The converse 
construction follows a similar idea by observing that the language of a prefix- 
closed context-free grammar can be accepted by a pushdown automaton (with 
all states accepting), computable in logarithmic space. □ 

Lemma A.l. Let G = (V , A, R, S, Cmit) be a GVAS with max{|a| : a € A} < n. 
One can construct, in logspace, an equivalent GVAS G' = {V', A', R!, S', Ci n u) 
with the same reachability set and such that A! = {—1,0,1}. 

Proof. G' will be a copy of G, extended as follows. For all 1 < m < n, there are 
new nonterminals B m and rules B\ b 1 and B m b for m > 0. Now 

all terminals a G A such that a > 1 are removed from A' and replaced (on right 
hand sides of all rules) by a new nonterminal X a . The only rule that rewrites 
this symbol is 

X a \~ B^B^f.-.B^ (2) 

where a = .. .b i is the binary representation of a. Note that B is the 

empty word if b m = 0 and B m otherwise. We thus observe that the language 
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contains only the word l a . In particular, for any c, d £ N we get that c —> d iff 

x a 

c ->■ a. 

An analogous construction allows to replace all terminals a £ A with a < 1. 
The resulting GVAS G' has terminal alphabet A' = {—1,0,1} as required. □ 


A.4 Proofs for Section 4 

S 

Lemma 4.2. For all d with Ci n u —> d, there exists a flow tree with out(e) = d. 

Proof. Assume that c lnlt —> d. It holds that Ci„u —A d for some z £ Ls- Since 
z £ L$, there exists a derivation S => z , hence, a complete parse tree with root 
labeled by S and yield This complete parse tree, together with the fact that 
Cinit —A d, induces a flow tree with root e : Ci n uSd. □ 

Lemma 4.4. Every bad flow tree has at most F u .\v\(cinit + 2) nodes. 

Proof. Let T = (T, sym, in, out) be a flow tree. We construct a nested sequence 
that corresponds to a depth-first pre-order traversal of the flow tree. Let us 
introduce, for each symbol X £ V, two copies X' and X". We associate to each 
node t £ N* a word 9{t) over SxN, inductively defined as follows: 

6{t) = if to £ T 

1 e otherwise 

where A' = sym{t ) and to = in(t). Recall that the condition tO £ T means that t 
is an internal node of T. It is readily seen that 0(e) is a nested sequence. Let us 
write it as 0(e) = (si, hi ),..., (se, he). Obviously, every index 1 < i < £ can be 
mapped back to a node t(i) of the flow tree T. 

Assume that T is bad, and suppose, towards a contradiction, that 0(e) is good. 
So there exists i < j such that Si A Sj and hi < h i+ 1 ,... ,hj. This entails that 
t[i) is an ancestor of t(j). Moreover, t(i) ^ t{j) because each node t £ T is visited 
three times in the sequence 0(e), and each visit uses a different copy of sym[t). So 
t(i) is a proper ancestor of t(j). Since Sj A sj, we get that symftfi)) = sym(t(j)) 
and in(t(i)) < in(t(j)), which contradicts our assumption that T is bad. 

We have shown that the nested sequence 0(e) is bad. Let us show that 
0(e) is Cinit-controlled. Let 1 < j < t. Since A = {—1,0,1} by assumption, it 
holds that in(t(j)) < c ml t + L where L denotes the number of leaves that are 
lexicographically smaller than t(j). Recall that G is in Chomsky normal form by 
assumption. So these leaves have distinct parents, and those are all visited before 
t(j) in the sequence 0(e), hence, j > L. It follows that ||sj|| = in(t(j)) < c init +j. 

Since the nested sequence 0(e) is ( c init + 2)-controlled and bad, we derive 
from Theorem 4.3 that its length i satisfies t < F u .\v\( c init +2). The observation 
that |T| < t concludes the proof. □ 

Theorem 4.6. A prefix-closed GVAS G is unbounded if, and only if, there exists 
a certificate for G. 
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Proof. Assume that the reachability set of G is infinite. So there exists d such that 
Cinit —> d and d > c mzt + F u .\ v \ (c lmt + 2). Pick a flow tree T = (T, sym, in, out) 
with root e : Ci n i t Sd, among those of least size. Note that such a flow tree exists 
by Lemma 4.2. Let z £ A* denote the yield of the complete parse tree ( T, sym). 
It is readily seen that Cmit —A e for some e > d. Recall that A = {—1,0,1} by 
assumption. It follows that |z| > = e — Ci n it > F u .\v\(ci n it +2). Observe that 

|T| > |z| since z is the yield of T. It follows from Lemma 4.4 that the flow tree 
(T, sym, in, out) is good. So it contains a node t and a proper ancestor s -< t such 
that sym(s) = sym{t) and in(s) < in(t). To prove that ( T,s,t) is a certificate 
for G, it suffices to show that in(s) < in(t) or out(t) < out(s). Assume, by 
contradiction, that in(s) = in{t) and out(t) > out(s). We may replace, without 
breaking the flow conditions, the subtree rooted in s by the subtree rooted in t. 
We may even preserve the input and output of s. The resulting flow tree also has 
root e : CinitSd, but it has less nodes than 7”, which contradicts the minimality 
of T. 

Conversely, assume that there exists a certificate (T, s, t) for G, with T = 
(T, sym, in, out). By definition, it holds that s -< t, sym(s) = sym(t), and either 
m(s) < in{t) or m(s) = in{t) and out(t) < out(s). Let X denote the common 
nonterminal X = sym(s) = sym(t). We decompose the yield z £ A* of the 
complete parse tree (T, sym) into z = xuwvy, as depicted in Figure 3, where: 

— x and y come from the leaves that are lexicographically smaller and larger 
than s, respectively, 

— u and v come from the leaves of the subtree rooted in s that are lexicograph¬ 
ically smaller and larger than t, respectively, and 

— w comes from the leaves of the subtree rooted in t. 

It is readily seen that S ==> xXy, X => uXv and X ==> w. Since Ls is prefix- 
closed, we get that {xu n \ n £ N} C Ls and {xu n wv n \ n £ N} C Ls- Observe 
that in(t) < in(s) + ^u and out\s) < out(t) + Yfv. These two inequalities follow 
from the flow conditions. There are two cases. 

1. Either in(s) < in(t), in which case > 0. Since —oo < in(t), it holds that 

Cmit ——> d for some d. We derive that Ci n u —^ c > c + n ■ u for every 

s 

neN, where c = Ci n u + ^x. Since u > 0, we derive that {d \ Ci n u —> d} 
is infinite. 

2. Or in(s) = in{t) and out(t) < outfs), in which case ^uPO and > 0. 
Since —oo < out(s), it holds that Ci nlt luwv > d f or gome d. We derive that 
Cinit XU W > c+n-^f) u c - uv f° r every n £ N, where c = Cmit+Y.Z xw ■ 
Since Yh uv > 0, we derive that {d | Cj„j t —»• d} is infinite. 

In both cases, we obtain that the reachability set of G is infinite. □ 

A.5 Proofs for Section 5 

Lemma 5.1. Every GVAS G admits a complete elementary parse with a yield 
w such that A G £ w, +oo}. 
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Proof. Notice that if A G < +oo there exists a complete parse tree with a yield 
w such that Y w = A G . Since G admits a complete parse tree, we can pick a 
complete parse tree T that is minimal wrt. the number of nodes and with a 
yield w that satisfies A G £ {£ w, + 00 }. Assume by contradiction that T is not 
elementary. In that case, there exist two distinct nodes s A t that are labeled by 
the same non-terminal symbol X £ V. Notice that if A G = + 00 , by collapsing 
in T the nodes s and t, we get a parse tree T' with a yield w' that naturally 
satisfies A G £ {£ w', + 00 }. Thus T' contradicts the minimality of T. We deduce 
that A G < +00 and in particular £ te = A G . Let us decompose the yield w as 
w = aubvc, where the subwords u, v derive from the pumpable path from node 
s to t. If £ u + X) v > 0, by inserting many copies of this subtree in T, we get 
A G = + 00 , which is impossible. It follows that + < 0. By collapsing the 

nodes s and t. we get a complete parse tree V for G such that \T'\ < |T| with a 
yield w' satisfying Y w ' = £ w ~ (£ w + £ From £ tt + £ v < 0 we derive 
£ w ' > £ ^- As £ w — A G and £ w’ < A G , we derive A G = £ w ' and we get 
a contradiction on the minimality of T. It follows that T is elementary. □ 

Lemma 5.3. If A G = + 00 , there exists a parse tree for G[X] where X is a 
non-terminal symbol derivable from the start symbol S with a yield uXv satisfying 
u,v £ A*, Y uv > 0, and a number of nodes bounded by 4J v l +1 . 

Proof. Let us first prove that there exists a non-terminal symbol X derivable 
from the start symbol S and a parse tree for G[X] with a yield uXv satisfying 
u, v £ A* and £ uv > 0- Since A G = + 00 , there exists a minimal (for the number 
of nodes) complete parse tree T with a yield w satisfying Y w > 2^ v L Observe 
that if T is elementary then |tt>| < 2^ v I and in particular Y w — 2^ and we get 
a contradiction. So the tree T is not elementary. Hence there exists s -< t in T 
with sym(s) = X = sym{t ) for some non-terminal symbol X. The subtree of T 
between s and t provides a parse tree for G[X\ with a yield uXv where u,v £ A*. 
If Y uv ^ 0 by collapsing nodes s and t in T, we derive a complete parse tree 
T' such that \T’\ < |T| with a yield w' satisfying Y w ' + J2 UV = Thus 

Y w' > 2d' I and we get a contradiction on the minimality of |T|. Thus Y uv > 0. 

In the previous paragraph, we have proved that there exists a non-terminal 
symbol X derivable from the start symbol S and a parse tree T for G[X\ with a 
yield uXv satisfying u,v £ A* and Y uv > 0- Without loss of generality, we can 
pick X and T is such a way |T| is minimal. Let t be the unique leaf of T labeled by 
X and assume by contradiction that |t| > |V|. In this case, there exists r A s A t 
such that sym{r ) = X' = sym(s) for some non-terminal symbol X'. Notice that 
in that case the subtree between r and s is a parse tree T' for G[X'] with a yield 
u'X'v 1 where u',v' £ A*. Since \T'\ < |T|, by minimality of T, we get Y u ' v ' — 0- 
In particular, by collapsing in T the nodes s and t we get a parse tree T" for 
G[X] with a yield u"Xv" such that Y u " v " + E u'v' = Y uv. Thus £ u"v" > 0. 
We get a contradiction on the minimality of |T| since \T"\ < \T\. Hence |t| < \V\. 
Symmetrically, observe that if there exists r -< s such that sym(r) = X' = sym(s) 
for some non-terminal symbol X' and such that r ^ t then we get a contradiction 
on the minimality of T. Therefore T can be decomposed as a branch for the 
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root to t in such a way nodes on this branch, except t , have exactly, on the 
right or on the left an elementary subtree. Thus \T\ < \V\ + 1 + |F|2l v I+ 1 . Since 
\V\ + l< 2l v l+\ we get \T\ < 2l y l +1 + \V\2\ v \ +l = (\V\ + l)2l v 'l +1 < 4l y l +1 . 

□ 

Lemma 5.4. For every derivable non-terminal symbol X, there exists a parse 
tree with a yield in A*XA* and a number of nodes bounded by 4) v ' +1 . 

Proof. Since X is derivable from S, there exists a sequence Xq. ..., X^ of non¬ 
terminal symbols with A' 0 = S', Xj~ = X, k + 1 < \V\, and a sequence of 
production rules Xj _i b ajXjf3j with ctj/3j = Yj for some non-terminal symbol 
Yj. Notice that there exists a complete elementary parse tree Tj for G\Yj\. 
The parse trees Ti, ... ,Tk put along a branch labeled by Xq, ... , Xk provide 
a parse tree for G with a yield in A* XA* and a number of nodes bounded by 
(fc + 1) + fc2l y l +1 < |V| + (|F| - l)2l y l +1 < |l/|2l y l +1 < 4l y l +1 . □ 


A.6 Proofs for Section 6 

Fact 6.3. It holds that |s| < \V\ and |f| < |s| + \V\ + 1. 

Proof. Suppose, towards a contradiction, that |s| > \V\. There must exit two 
nodes p -< q -< s with sym(p) = sym{q). If in(p) < in(q) then we get a certificate 
(T 7 ,p, q) of strictly smaller rank than (fT,s,t) by setting the input value of t 
to —oo and propagating onwards. If in(p) > in{q) then we may replace the 
subtree rooted in p by the subtree rooted in q , retaining the flow conditions 
since out(p ) = out(q) = — oo by Fact 6.1, and thus get a certificate of strictly 
smaller rank than (T, s, t). Both cases contradict the minimality of (T, s, t). This 
concludes the proof that |s| < \V\. 

Now suppose, towards a contradiction, that |f| > |s| + \V\ + 1. There exists 
necessarily two nodes s -< p -< q -< t with sym{p) = sym(q). If in(p) < in(q) 
then we get a certificate (T',_p, q) of strictly smaller rank than (T, s, t) by setting 
the input value of t to —oo and propagating onwards. Similarly, if in(p) = in(q) 
and out{q) < outfp) then we get a certificate (T',p,q) of strictly smaller rank 
than (T, s, t) by setting the output value of s to —oo and propagating onwards. 
If in(p) = in{q) and out(q) > out(p) then we may replace the subtree rooted 
in p by the subtree rooted in g, and thus get a certificate of strictly smaller 
rank than (' T,s,t ). The remaining case is when in(p) > in(q). In that case, we 
collapse the nodes p -< q, preserve the input value of p, and relabel all nodes 
lexicographically larger than p with the largest input/output values allowed by 
the flow conditions. In the resulting flow tree T ', which has a smaller rank than 
T, the node t' originating from t has a strictly larger input value than s. So 
(T',s,t') is a certificate. All cases contradict the minimality of (T, s,f). This 
concludes the proof that |t|<|s| + |F| + l. □ 

Fact 6-4- It holds that in(p) < out{p) + 2l v I for every node p£T with pf4t. 
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Proof. Suppose, towards a contradiction, that in(p) > out(p) + 2b I for some 
node with p -/ft. Recall that A = {—1,0,1} by assumption. If the subtree 

rooted in p has at most 2b I leaves, then we may decrease the input and output 
values of its nodes, retaining the flow conditions, so that the output value of p is 
preserved and its new input value is at most out(p) + 2b'I. Notice that this does 
not modify the main branch since p t. Thus, we get a certificate of strictly 
smaller rank than (T, s, t). Otherwise, the subtree rooted in p has at least 2 b'' + 1 
leaves. Hence, it is not elementary and we may reduce it into a strictly smaller, 
elementary subtree with at most 2b I leaves. The latter induces a complete flow 
tree with the same input and output values for p. Again, this does not modify 
the main branch since p ^ t. Thus, we get a certificate of strictly smaller rank 
than (T,s,t). □ 

Fact 6.5. Let q £ T and let p be the parent of q. If p = t or p -ff t, then 
out.(q) < out(p) + 2' v '. If moreover sym(p) = sym(q), then out(q) < out(p). 

Proof. Assume that p = t or p t. Observe that the children of p are not on the 
main branch, i.e., none of them is a prefix of t. If q is the last child of p, then 
out(q) = out(p) by minimality of (T, s,t). Otherwise, q = pO and pi is the last 
child of p. It holds that outfpO) = in (pi) and out (pi) = out(p) by minimality of 
(T, s,t). We derive from Fact 6.4 that out(q) < out(p) + 2 b L 

Now assume, in addition, that sym(p) = sym(q). Suppose, towards a contra¬ 
diction, that out(q) > out(p). If in(p) < in(q) then we get a certificate (T',p,q) 
of strictly smaller rank than (T, s,t) by setting the input value of t to — oo and 
propagating onwards. If in(p) > in(q) then we may replace, retaining the flow 
conditions since out(q) > out(p ), the subtree rooted in p by the subtree rooted 
in q, and thus get a certificate of strictly smaller rank than (T, s,f). Both cases 
contradict the minimality of ( T,s,t ). It follows that out(q ) < out(p ), which 
concludes the proof of the fact. □ 

Fact 6.6. It holds that out(t) < out(s) < 6|R| ■ 4^ I+ 1 . 

Proof. If in(s) < in(t ) then out(s) = out(t) = —oo by Fact 6.2, so the equality 
out(s) = out(t ) + 1 holds. Otherwise, out(t ) < out(s). If we had out(t) + 1 < 
out(s) then we could decrease the output value of s by one, retaining the flow 
conditions by Fact 6.1, and get a certificate of strictly smaller rank than (T, s, t), 
contradicting the minimality of (T, s,i). Therefore we get that 

out(s) = out(t) + 1 (3) 

and in particular the first inequality of the claim. 

Let us now prove that out(s) < K , where K = 3(|R| + 1)4^ I+ 1 . Suppose, 
towards a contradiction, that out(s) > K. Observe that out(t) > K because 
of Equation (3). Let us consider the subtrees on the right of the branch from 
s to t. The main idea of the proof is to replace these subtrees by smaller ones 
using Theorem 5.2. Formally, let U = {pi € T \ s A p ^ t A pi ^ t}. The 
set U collects the right-children of the main branch from s to the parent of t, 
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excluding those that are on the branch themselves. Let u\,... ,Uk denote the 
elements of U, in lexicographic order, and let Si = sym(ui) for i = l,...,k. 
Note that Si £ V since G is in Chomsky normal form by assumption. Observe 
also that out(s) < outft ) + A# v ..# k due to the flow conditions. It follows from 
out(s) = out(t) + 1 that A# 1 ...# k > 0. 

If the total size of the subtrees rooted in u \,..., it*, is at most I \, then the 
flow conditions entail that the input and output values of their nodes are all 
strictly positive, since out(s) > K and A = {—1,0,1} by assumption. The same 
holds for the output values of the nodes p with s Ap <t. So we may decrease 
all these values by one, retaining the flow conditions. Indeed, Fact 6.1 guarantees 
that the first flow condition still holds for the parent of s. We obtain, in this way, 
a certificate of strictly smaller rank than (7~, s, t). 

Otherwise, the total size of the subtrees rooted in u \,..., Uk is at least K + 1. 
Observe that k < \V\ + 1 by Fact 6.3. According to Theorem 5.2, there exists 
Ti,..., Tfc, where each T,, is a complete parse tree for G[Sj] with yield z,, such 
that \T\\ + • • • + |Tfc| < 3fc4l' / l +1 < K and Y z i ■ ■ - z k > 0- Let us replace the 
subtrees rooted in u\,...,Uk by Ti,...,T/., respectively. Since out(t ) > K, this 
induces a complete flow tree T' = (T', sym ', in', out'), with out'(t) = out(t), and 
satisfying out'(s) = out'(t) + z i ■ ■ ■ z k > out'(t). The new output value of s 
might be smaller, but Fact 6.1 guarantees that the first flow condition still holds 
for the parent of s. The input values of s and t were not changed, so we have 
in'{s) = in(s ) < in(t) = in'(t). Therefore, ( T’,s,t ) is a certificate of strictly 
smaller rank than (T, s, t). 

In both cases, we obtain a contradiction with the minimality of ( T,s,t ). The 
observation that K < Q\V\ ■ 4J V l +1 concludes the proof of the fact. □ 

Fact 6.7. It holds that in(s) < in(t) < 7\V\ ■ 4^ l +1 . 

Proof. We first show that in(s) < in{t) < in(s ) + 1. Recall that in{s) < in(t) 
by definition of certificates. If we had in(s) + 1 < in(t) then we could decrease 
the input value of t by one, retaining the flow conditions by Fact 6.2, and get a 
certificate of strictly smaller rank than (7”, s, t), contradicting the minimality of 
( T,s,t ). Therefore, in(t) < in(s) + 1. 

Observe that t is an internal node since sym(s) = sym(t) cannot be in Au{e}. 
Let us bound the input value of its first child. According to Fact 6.4, it holds that 
in(tj) < out{tj) + 2l v I for each child tj of t. Let k £ {1, 2} denote the number 
of children of t. The flow conditions together with the minimality of (T, s,t) 
guarantee that in(t0) = in(t), out(t ) = out(t(k — 1)), and in(t.(j + 1)) = out(tj) 
for every j = 0,..., k — 1. We derive that in(tO) < outft ) + 2 • 2^ l It follows 
from Fact 6.6 that in(t0) < K + 2^ v l +1 , where K = 6| V| • 4) v \ +1 . 

Let us now prove that in(t) < H, where H = K + 2^ +1 . The proof is 
similar to the proof of Fact 6.6. Suppose, towards a contradiction, that in(t) > H. 
Observe that in(s ) > H. Let us consider the subtrees on the left of the branch 
from s to t. The main idea of the proof is to replace these subtrees by smaller 
ones using Theorem 5.2. Formally, let U = {p0 £ T \ sPp^tApO^t}. The 
set U collects the left-children of the main branch from s to the parent of t, 
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excluding those that are on the branch themselves. Let u\,..., Uk denote the 
elements of U, in lexicographic order, and let Si = sym{ui) for i = 1,..., k. Note 
that Si £ V since G is in Chomsky normal form by assumption. Observe also 
that in(t) < in(s) + A# 1 ...# k . It follows that A# 1 ...# k > 0 and that A# 1 ...# k > 0 
if in(s) < in(t). 

If the total size of the subtrees rooted in u ±,..., Uk is at most A', then the flow 
conditions entail that the input and input values of their nodes are all strictly 
positive, since in(t) > H > K and A = { — 1,0,1} by assumption. The same 
holds for the input values of the nodes p with s A P At. So we may decrease all 
these values by one, retaining the flow conditions. Indeed, the first flow condition 
still holds for t since in(tO) < H. We obtain, in this way, a certificate of strictly 
smaller rank than ifT, s, t). 

Otherwise, the total size of the subtrees rooted in u±,...,Uk is at least 
K + 1. Observe that k < |V| + 1 by Fact 6.3. According to Theorem 5.2, there 
exists Ti,..., I*,, where each 7} is a complete parse tree for G[S}] with yield 
Zi, such that |7i| + • • • + \T^\ < 3fc4l v l +1 < AT and Y z i ■ ■ - z k > 0. Moreover 
Y Zi ... Zk = 0 only if in(s) = in{t). Let us replace the subtrees rooted in 
Ui,...,Uk by T-|,...,Tfc, respectively. Since in(s) > H > AT, this induces a 
complete flow tree T' = (T', sym !, in ', out'), with in'{s) = in{s ), and satisfying 
in'(t) = in'(s) + Y z i ■ ■ ■ z k > in'(s). The first flow condition still holds for t 
since in'(t0) = in(t0) < H < in(s) = in'(s) < in'(t). The output values of s and 
t were not changed. It follows that in' (s) < in'(t) or out'{t) < out'(s). Indeed, 
if in' (s) = in'(t) then Y z i ■ ■ ■ z k =0, hence, in(s) = in(t), which entails that 
out'(t) = out(t) < out(s) = out'{s). Therefore, (T',s,t) is a certificate of strictly 
smaller rank than (T, s, t). 

In both cases, we obtain a contradiction with the minimality of (7”, s, t). The 
observation that H < 7\V\ ■ 4J V l +1 concludes the proof of the fact. □ 

Fact 6.8. It holds that out(p) < 4 2 ^ v l +1 ) for every ancestor p At. 

Proof. Let us write t = sji ■ ■ ■ jk where each £ {0,1}, and let pi = sj\ ■ ■ ■ ji 
for i = 0,..., k. We first show that for every 0 < i < k, 

out(pi) < out(pi- 1 ) + 2 |y| . (4) 

Indeed, the flow conditions together with the minimality of (T, s, t) guarantee, 
for every ancestor s A P A t, that out(p) = out(pl) and that out(p0) = in(pl) if 
pO A t. In the latter case, in(jpl) < out(pl) + 21 v "I by Fact 6.4, hence, out(p0) < 
out(p) + 2l v I. We have thus shown that out{pj) < out(p) + 2l v I for every ancestor 
s ApFt and every j £ {0,1} such that pj A t. 

We derive from Equation (4) that out(pi) < out(s) + *2l y l for all 0 < i < k. 
Recall that k < \V\ + 1 by Fact 6.3. It follows from Fact 6.6 that for every node 
p such that s A P At, we have 

out (p) < out{s ) + |V| • 2' y l < 6|V| • 4' y l +1 + \v\ ■ 2' y ' < 4 2 d y l +1 ) 

According to Fact 6.1, it holds that out{p) = — oo for every proper ancestor p -< s, 
which concludes the proof of the fact. □ 
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Fact 6.9. It holds that in(p) < 4 2 ^ v l +1 ) for every ancestor e -< p < t. 

Proof. Let us write t = j i • • • jk where each ji £ {0,1}, and let Pi = j i • • • ji for 
i = 1,..., k. We claim that in(pi- 1 ) < in(pi) + 2^' I for every 1 < i < k. Indeed, 
the flow conditions together with the minimality of (' T , s , t) guarantee, for every 
ancestor e -< p -< t, that in(p) = in(p0) and that out(p0) = in(pl) if pi ^ t. In 
the latter case, in(pO) < out(p0) + 2^ I by Fact 6.4, hence, in(p) < in(pl) + 2^ L 
We have thus shown that in(p) < in(pj) + 2^1 for every ancestor e -< p -< t and 
every j £ {0,1} such that pj ^ t. This concludes the proof of the claim. 

Observe that s = j i ■ ■ ■ jh where h = |s|. We derive from the claim that, 
firstly, in(pi ) < in(s) + (h — i)2' v ' for all 0 < i < h, and secondly, in(pi) < 
in(t) + (k — i) 2^1 for all h < i < k. Recall that h < \V\ and (fc — h) < \V\ + 1 
by Fact 6.3. It follows from Fact 6.7 that, for every ancestor e p ^ t, we have 

in(p) < max{m(s), in{t)} + |W121 v I < 7\V\ ■ 4^ +1 + |R|2l v I < 4 2 ^ v 
which concludes the proof of the fact. □ 


